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Abstract 

A property of finite graphs is called nondeterministically testable if it has 
a "certificate" such that once the certificate is specified, its correctness can be 
verified by random local testing. In this paper we study certificatees that consist 
of one or more unary and/or binary relations on the nodes, in the case of dense 
graphs. Using the theory of graph limits, we prove that nondeterministically 
testable properties are also deterministically testable. 

1 Introduction 

Let V be a property of finite simple graphs (i.e., a class of finite simple graphs closed 
under isomorphism). We say that V is testable, if there exists another property T 
(called a test property) satisfying the following conditions: 

• if a graph G has property V , then for all 1 < r < \V(G)\, a random induced 
subgraph on r nodes (chosen uniformly among all such induced subgraphs) has 
property T with probability at least 2/3, and 

• for every e > there is a r £ > 1 such that if G is a graph whose edit distance 
from V is at least e|F(G)| 2 , then for all r e < r < \V(G)\, a random induced 
subgraph on r nodes has property T with probability at most 1/3. 

This notion of testability is often called oblivious testing, which refers to the fact 
that no information about the size of G is assumed. The definition extends trivially 
to graphs whose nodes and/or edges are colored with a fixed finite number k of colors. 

Let L be a graph whose nodes and edges are fc-colored (k > 1). We call such a 
graph briefly a k- colored graph. It will be convenient to assume that the graph G 
itself is complete; we can consider the missing edges as having a further color. Given 
a positive integer m < fc, we can get an ordinary graph from L by keeping only the 
edges with colors 1, . . . , m, and then forgetting the coloring. We will denote this graph 
by L'. If Q is a property of colored graphs, then we define Q' = {L 1 : L e Q}. 

A graph property V is nondeterministically testable, if there exist two numbers 
k > m > 1 and a property Q of fc-colored graphs such that Q is testable and Q' = V . 
In other words, G has property V if and only if we can color the nodes with k colors, 
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the edges with m colors and non-edges with further fc — m colors so that the resulting 
fc-colored graph has property Q. We call such a coloring a certificate for V. 

Instead of a fc-coloring, we could specify fc unary and fc symmetric binary relations 
on V(G) as a certificate (this would be more in the spirit of mathematical logic). The 
fact that in a coloring they are disjoint and partition V(G) and (^g )> respectively, 
can be easily tested. Conversely, such a system of relations defines a 2 fc -coloring. As 
long as we are not concerned with efficiency, these two ways of looking at certificates 
are equivalent. 

Clearly every testable property is nondeterministically testable (choosing k = m = 
1). Our main result asserts the converse. 

Theorem 1.1 A graph property is nondeterministically testable if and only if it is 
testable. 

The proof uses the theory of graph limits as developed in [H [5] , and its connection 
with property testing |10j . 



2 Preliminaries 

2.1 Convergence and limits 

Convergence of a sequence of dense finite graphs was defined by Borgs, Chayes, Lovasz, 
Sos and Vesztergombi [3j [4] . Graphons were introduced by Lovasz an Szegedy in [8] 
as limits of convergent sequences of finite graphs. We have to extend these notions 
to colored graphs; this was essentially done in [TT], but we use here a little different 
terminology. 

For a graph G and integer r < |V(G)|, let us select an ordered r- tuple of nodes 
of G randomly and uniformly (without repetition). Let G(r, G) denote the subgraph 
induced by these nodes. If G is colored, then G(r, G) is also a colored graph in the 
obvious way. 

We say that a sequence of colored graphs L„ is convergent, if |V^(G„)| — > oo, and 
for every r > 1, the distribution of G(r,L n ) tends to a limit as n oo. Note that 
this distribution is over a finite set, so it does not matter in which norm we want it to 
converge. 

The limit object of a convergent sequence of simple graphs can be described as 
a symmetric measurable function W : [0,1] 2 — > [0,1], called a graphon. We will 
need the more general notion of a kernel, a bounded symmetric measurable function 
W : [0, l] 2 -> R. 

For a sequence of fc-colored graphs, the limit object is a bit more complicated: it 
consists of k graphons {W 1 , . . . , W k ) such that J2h W k — 1; and also of a partition 
(coloring) A 1 U • • • U A k of [0, 1] into measurable sets. We call the (2fc)-tuple W = 
(W l ,...,W k ,A x ,...,A k ) a k-graphon. 

Let L be a fc-colored graph with V(L) — [n]. Let J n denote the partition of [0, 1] 
into n intervals J%, . . . ,J n of equal length. We can associate with L a fc-graphon 
W L = (Wl, . . . , W k ,A\, . . . , A k L ), where A\ is the union of intervals Ji for which i 
has color ft, in L, and 



1, if x G Ji, y G Jj , and the color of ij is h, 
0, otherwise. 
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More generally, we can consider a fractionally k-colored graph H in which for the nodes 
are fc-colored, and for every unordered pair of nodes, we have fc nonnegative 

weights j), ... , f3 k (i,j) whose sum is 1. (We could also consider fractional node- 
colorings, but these are not needed to prove our theorem, and they would not lead to 
greater generality.) We consider every fc-colored graph as a special case, where (3 l is 
the indicator function of the i-th edge color. For a fractionally fc-colored graph H , we 
define the fc-graphon Wjj in the obvious way. 

We can sample a fc-colored graph G(r, W) on node set [r] from a fc-graphon W 
as follows: we choose r independent random points X\, . . . ,X r G [0, 1] uniformly; we 
color i € [r] with color h if Xi e Ah, and we color a pair (i,j) with color h with 
probability W h (Xi, Xj) (independently for different pairs of nodes). 

The following fact was proved in 

Proposition 2.1 Let L n be a convergent sequence of k-colored graphs. Then there is 
a k-graphon W such that G(r, L n ) G(r, W) in distribution. 

We write L n — >• W if this holds. 

Let W be a graphon, and let J be a partition of [0, 1] into measurable sets 
Ji, . . . , J m with positive measure. We denote by Wj the graphon obtained by av- 
eraging W in every rectangle Ji x Jj. More precisely, for x £ Ji and y € Jj we 
define 

Wv( X ,y) = J(J ^ ( j- ) J W(u, z )dud z . 

Ji X Jj 

We quote a well-known fact (for a proof see e.g. Lemma 3.5 in [4]): 
Proposition 2.2 For every kernel W , we have Wj n — » W almost everywhere. 

2.2 Many distances 

Convergence to a fc-graphon can be described in more explicit forms. Let us start with 
recalling the cut distance of two graphs G and G' on the same node set V (introduced 
by Frieze and Kannan [7]): 

d a {G,G')= max — , 

S,TCV \V\ 2 

where eg (<!?, T) denotes the number of edges with one endpoint in S and the other in 
T. This can be extended to two fractionally fc-colored graphs H and H 1 on the same 
node set V, with edgeweights (3fi(i,j) and /?^,(i, j) (h — 1, . . . , fc), by the hairy (but 
in fact quite natural) formula 

k 

I I h=l ' - -es 

where D\ is the number of nodes colored differently in H and H' . 
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A related notion for kernels is the cut norm 

\\W\\ n = sup / W(x,y)dxdy. 

S,TC[0,1] J 
SxT 

The cut norm has many nice properties (see [1]), of which we need the following 
(Lemma 2.2 in [TP]: 

Proposition 2.3 Let W n (n = 1,2,...) be a sequence of uniformly bounded kernels 
such that || W n ||n — > 0. Then for every bounded measurable function Z : [0, l] 2 — > M, 
we have \\W n Z\\a — > 0. 

(It would be enough to assume that Z is integrable, and the proof in |10j would work 
for the general case with a little care.) 

The cut norm defines a distance function between two kernels by d\j(U, W) = 
\\U — W\\u- From the point of view of graph limits, however, a kernel is only relevant 
up to a measure preserving transformation of [0,1]. Hence it is often more natural to 
consider the following distance notion, which we call the cut distance: 

8 a (U,W) = ini\\U+-Wrl'\\n, 

where (/>, ip : [0, 1] — > [0, 1] are measure preserving maps, and U^(x, y) = U{<j){x), 4>{y))- 
This defines a pseudometric on the set of kernels (it is only a pseudometric, since 
different kernels may have measure 0). An important fact is that endowing the space 
of all graphons with this pseudometric makes it compact [5J. 

We generalize the cut norm distance to two fc-graphons U = 
ip 1 ,..., U k ,A\ . . . , A k ) and W = (W 1 , . . . , W k , B 1 ,..., B k ): 

k k 

d n (U,W) = ^A(A h AB fe ) + ^||C/ /l -W /l || n . (1) 

h=l h=l 

Similarly as above, we need the "unlabeled distance" 

<5 n (U, W) = inf cfo(U^, W^) (2) 

(where U"^ is obtained from U by substituting <j>(x) for x in each of the 2k functions 
constituting U) . We can use this notion to define the cut distance of two fractionally 
/c-colored graphs H and H' by 

5 D (H,H>) = S D (W H ,W H ,). 

It is easy to see that if V(H) = V(H'), then 

Sa{H,H')<da(H,H'). 

The following result is proved (in a more general form) in |11| . 

Proposition 2.4 Let L n be a sequence of ' k- colored graphs, and let W be a k-graphon. 
Then L n -> W if and only if 6 a {W Ln , W) -»• 0. 
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We cannot claim convergence in da distance, since depends on the labeling 

of the nodes of L n , while the convergence L n — > W does not. However, at least in the 
case of simple graphs, the following stronger version is true ([3], Theorem 4.16): 

Proposition 2.5 Let G n be a sequence of graphs, and let W be a graphon such that 
G n — > W . Then the graphs G n can be labeled so that || Wg„ — W||n — * 0- 

From the point of view of property testing, the "edit distance" is very important 
(in fact, from the point of view of analysis, property testing is about the interplay 
between the edit distance and the cut distance). For two graphs G and G' on the 
same set of nodes V = V(G) = V(G'), their edit distance is defined by 

_ \E(G)AE(G')\ 
di{G,G ) - ^2 . 

We generalize this for two fc-colored graphs L and V on the same node set: 



\V\ \V\^ 

where D\ is the number of nodes colored differently, and D2 is the number of edges 
colored differently, in L and L'. 

For two kernels, their edit distance is just their Li-distance as functions. For two 
fc-graphons, their edit distance is defined by a formula very similar to (JT|): 

k k 

di(U, W) = X(A h AB h ) + \\U h - W h \\L 

h=l h=l 

Similarly to ([2]), we could define the unlabeled version of the edit distance, but we 
don't need it in this paper. 

The following (easy) characterization of testability of graph properties was formu- 
lated in [10], Theorem 3.20. 

Proposition 2.6 A graph property V is testable if and only if for any two sequences 
(G n ) and (G' n ) of graphs such that \V(G n )\, \V(G' n )\ -> oo, Sa(G n ,G' n ) -> and 
G' n G V, we have di{G n ,V) -> 0. 

3 Main proof 

We start with a randomized construction to obtain a fc-colored graph from a frac- 
tionally fc-colored graph H . We keep the color of every node. We color every edge 
ij G ('"') with color h with probability f3 h (i,j). For different pairs i,j we make an 
independent decision. We denote this random fc-colored graph by h(H). 

Lemma 3.1 Let H be a fractionally k- colored graph on n nodes. Then 

1 ni- 

d D (H,UH)) < 
with probability at least 1 — ke~ n . 
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Proof. For two edge colors, this is just Lemma 4.3 in [J]. For general k, it follows 
by applying this fact to each edge-color separately. □ 

The main step in the proof of Theorem 11.11 is the following lemma. 

Lemma 3.2 Let W = (W 1 , . . . , W k ,B 1 , . . . , B k ) be a k-graphon, and let U = 
Tyh=i W h (where 1 < m < k). Let G n be a sequence of graphs such that G n — > U. 
Then there exist k-colored graphs L n on V{G n ) such that L' n = G n and L n — > W. 



Proof. First, we construct a fractionally fc-colored graph H n on V n = V(G n ). To 
keep the notation simple, assume that V n = [n]. We may assume without loss of 
generality that B\, . . . ,Bk are intervals of [0, 1]. By Proposition 12.51 we can choose 
the labeling of the nodes of each G n so that ||Wg„ — U\\a 0. We color node i with 
color h if the left endpoint of the interval J; belongs to interval B h . 
For every pair i, j € [n], we select any x € Ji and yd Jj, and define 



Ai 



W h 7 (x,y) 



/3 h (i,j)=! -Ujn&y) 



W^(x,y) 
±-U Jn {x,y) 



if 1 < h < m, 
if m + 1< h < k. 



where A n denotes the adjacency matrix of G n (these numbers are independent of the 
choice of x and y). It is easy to check that ^ h (i h {i,j) = 1 for all i ^ j. We show that 
for the fractionally fc-colored graph H n constructed this way, we have 



dn(W H „,W)->0. 



(3) 



Let C h denote the union of those intervals Ji for which i got color h, i.e., whose left 
endpoint is contained in B h . Then we have 



dn(Wfl- n ,W) 



'X(B h AC h ) + || -W h 



h=0 



It suffices to prove the convergence for every fixed h. Clearly C h is an interval, and 
the symmetric difference B h AC h is covered by two intervals Ji. So \(B h AC h ) < 2/n, 
and each term in the first sum tends to 0. 

The proof for the second term is trickier. We describe it for h < m; the other case 
is analogous. Since < W h < U, we can write W h — UZ, where < Z < 1, and 
Z = if U = 0. Then we have 



w h \ 



sup 

S,TC[0,1] 



SxT 



Substituting from the definition, 
(Wh„ ~ W h ) = 

SxT SxT 



W 1 } (x,y) 
Uj n (x, y) 



W h (x,y)) dxdy 



G 



We split this integral as follows 

(UZ) Jn , 



w Gn - 



SXT 
17 = 



w G , 



SxT 



(4) 



SxT 



The first term, which is nonnegative, can be estimated as follows: 

(W G „ - U)l u=0 < \\(W Gn - C0ltr=o||a- 



W Gn ( U T ^ Jn < j \V ( .;J,- n 



SxT 
U = 



SxT 



SxT 



Here the right hand side tends to by Proposition 12.31 The second term can be 
estimated like this. By Proposition ^. 21 we have (UZ)j n — > UZ and Uj n — > U almost 
everywhere. Hence (UZ)j n /Uj n — > Z in almost every point where U ^ 0. Since the 
integrand is bounded, this implies that 



W G , 



( (UZ): 

Finally for the third term in Q, we have 



Z 



0. 



SxT 



(W Gn -U)Z < ||(Wb„ - CO^IIn, 

SxT 

and here the right hand side tends to 0, again by Proposition ^. 31 This proves ((3]). 

To complete the proof of the lemma, we consider the graphs L n = h(H n ). By 
Lemma 13-11 we have 

d a (L n: H n )<^ (5) 
V n 

with probability at least 1 — e~ n . Since ^2 e~ n is convergent, the Borel-Cantclli 
Lemma implies that almost surely ([5]) holds for all but a finite number of indices n. 
Choosing the L n so that this occurs, we have da(L n , H n ) = dn(Wi n , Wjy n ) — > 0, and 
hence d n (W L „,W)->0. □ 

Proof of Theorem ll.il Let V be a nondeterministically testable property; we show 
that it is testable. By Proposition [2jj] it suffices to prove that if (G n ) is a sequence of 
graphs such that da^Gn^) —> 0, then di(G n ,V) —> 0. 

Since V is nondeterministically testable, there are integers 1 < m < k and a 
testable property Q of fc-colored graphs such that V — Q! . Let G n £ V such that 
da{G n ,G n ) —> 0. Since G n £ V, there are fc-colored graphs L n e Q such that 
G n = L'n- By selecting a subsequence, we may assume that the sequence (L n ) is 
convergent. Let W be a fc-graphon representing its limit, and let U = Y^h=i W h . 
Then G n U. From d\j(G n , G n ) ->0we see that G n U. 

Now we invoke Lemma 13.21 and construct fc-colored graphs L n such that L' n = 
G n and L n — > W. Hence dp(£ n > Q) —> 0. Since Q is testable, this implies that 
di(L n , Q) — > 0, and so we can change the color of o(n) nodes and o(n 2 ) edges in L n 
so that the resulting fc-colored graph M n belongs to Q. But then M' n g V, and M' n 
differs from G n in o(n 2 ) edges only, so d x {G n , V) < di(G n , M' n ) -4-0. □ 
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4 Applications 



One of the first nontrivial results about property testing concerned the maximum cut. 
Let us derive one version. The property of 2-node-colored graphs that "at least 99% 
of all edges connect nodes with different colors" is trivially testable, and hence: 

Corollary 4.1 The property of a graph that its maximum cut contains at least 99% 
of the edges is testable. 

In [10] , the upward closure of a graph property V was defined as the graph property 
consisting of those graphs that have a spanning subgraph in V . Suppose that V is 

testable, then the property of 3-edge-colored graphs that "edges with color 1 form a 

graph with property V" is testable, and hence: 

Corollary 4.2 The upward closure of a testable graph property is testable. 

Suppose again that V is testable, then the property of 4-edge-colored graphs that 
"edges with colors 1 and 2 form a graph with property V ', and edges with colors 2 and 
3 are fewer than 1% of all edges with colors 1 and 3" is testable, and hence: 

Corollary 4.3 IfV is a testable property, then the property that "we can change 1% 
of the edges to get a graph with property V " is also testable. 

5 Concluding remarks 

There are several possible analogues and extensions of our results. 

One could consider certificates in the form of non-symmetric binary relations. For 
example, the property of a graph that it has a transitive orientation can be certified by 
such an orientation. In this case, one could allow that the original graph is directed. 
The arguments above extend to this case without much difficulty. 

A more substantial extension would be to allow certificates in the form of fc-ary 
relations for any k. One could then allow hypergraphs instead of the original graphs. 
A limit theory for hypergraphs is available (Elek and Szegedy [6 , and we expect our 
main result to generalize to hypergraphs; however, several of the auxiliary results we 
have made use of have not been extended, and a full proof will take further research. 

A generalization in a different direction would be to consider, instead of coloring, 
node and edge decorations from a compact topological space. For example, the prop- 
erty of being a threshold graph can be certified by a decoration of the nodes by numbers 
from [0,1]. The limit theory for graphs has been extended to compact decorations [TT] ; 
perhaps our main result extends too, but this takes further research. 

Finally, let us mention that the situation is quite different in the case of graphs 
with bounded degree (for which a limit theory analogous to the dense case is available, 
and property testing has been extensively studied). Here the sampling method is to 
select r random nodes (uniformly), and explore their neighborhoods of depth r. The 
property of a graph G that "G is the disjoint union of two graphs on at least |V(G)|/3 
nodes" can be certified by coloring the nodes in these two graphs with different colors, 
so this property is nondeterministically testable. On the other hand, sampling will not 
distinguish between an expander graph and the disjoint union of two copies of it, so 
this property is not testable. 
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